Amendments to the Specification 

The paragraph beginning at page 124, Une 35, has been amended as follows: 
"Percent amino acid sequence identity may also be determined using the sequence 
comparison program NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). 
Th e NCBI BLAST2 s e qu e nc e comparison program may b e download e d from 
http ://www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several search parameters, wherein all of 
those search parameters are set to default values including, for example, unmask = yes, strand = 
all, expected occurrences = 10, minimum low complexity length =15/5, multi-pass e-value = 
0.01, constant for multi-pass = 25, dropoff for final gapped ahgnment = 25 and scoring matrix = 
BLOSUM62.- 

The paragraph beginning at page 127, line 16, has been amended as follows: 
-Percent nucleic acid sequence identity may also be determined using the sequence 
comparison program NCBI-BLAST2 (Altschul et al.. Nucleic Acids Res. 25:3389-3402 (1997)). 
Th e NCBI BLAST2 sequ e nc e comparison program may be download e d from 
http://www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several search parameters, wherein all of 
those search parameters are set to default values including, for example, unmask = yes, strand = 
all, expected occurrences =10, minimum low complexity length =15/5, multi-pass e-value = 
0.01, constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix = 
BLOSUM62.- 

The paragraph beginning at page 232, line 30, has been amended as follows: 
--A cDNA sequence isolated in the amylase screen as described in Example 2 above was 
found, by BLAST and FastA sequence alignment, to have sequence homology to a nucleotide 
sequence encoding sarcoma-associated protein SAS. This cDNA sequence is herein designated 
'DNA23020 (see Figure 16). The DNA23020 sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to 
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identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington! 
http:/^oz e man.mbt.waGhington. e du/phrap.docG/phrap.html ). The consensus sequence obtained 
therefrom is herein designated DNA35858. Two proprietary Genentech ESTs were employed in 
the assembly wherein those EST sequences are herein identified as DNA21971 (Figure 17; SEQ 
DD NO:38) and DNA29037 (Figure 18; SEQ ED NO:39).-- 

The paragraph beginning at page 274, line 32, has been amended as follows: 
"A cDNA sequence was isolated in the amylase screen described in Example 2 above 
and is herein designated DNA13199 (Figure 134; SEQ ID NO:332). The DNA13199 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g., GenBank) to identify existing homologies. The homology search was 
performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in 
some cases 90) or greater that did not encode known proteins were clustered and assembled into 
consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington ; http://boz e man.mbt.washington. e du/phrap.docs/phrap.html ). The 
consensus sequence obtained therefrom is herein designated as DNA22778.-- 

The paragraph beginning at page 275, line 31, has been amended as follows: 
-A cDNA sequence isolated in the amylase screen described in Example 2 above was 
herein designated DNA37642 (Figure 137, SEQ ID NO:338). The DNA37642 sequence was 
then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify homologies therebetween. The homology search 
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was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resuhing in a BLAST score of 70 (or in 
some cases 90) or greater that did not encode known proteins were clustered and assembled into 
consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington ; http://bo2 e man.mbt.washington.0du/phrap.docG/phrap.html ). The 
consensus sequence obtained is herein designated DNA4861 5.-- 

The paragraph beginning at page 309, line 25, has been amended as follows: 
--A cDNA isolated in the amylase screen described in Example 2 above is herein 
designated DNA26832 (Figure 220; SEQ E) NO:516). The sequence of DNA26832 was then 
used to search expressed sequence tag (EST) databases. The EST databases included public EST 
databases (e.g., GenBank) and a proprietary EST database (LIFESEQ™, Incyte Pharmaceuticals, 
Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
(Altshul et al. Methods in Enzvmology 266 : 469-480 [1996]). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode proteins were clustered 
and assembled into consensus DNA sequences with the program *'phrap*' (Phil Green, University 
of Washington, Seattle, Washington Washington; 
http://boz e man.mbt.washington. e du/phrap.docG/phrap.html ).-- 

The paragraph beginning at page 311, line 17, has been amended as follows: 
-Human thrombopoietin (THPO) is a glycosylated hormone of 352 amino acids 
consisting of two domains. The N-terminal domain, sharing 50% similarity to erythropoietin, is 
responsible for the biological activity. The C-terminal region is required for secretion. The gene 
for thrombopoietin (THPO) maps to human chromosome 3q27-q28 where the six exons of this 
gene span 7 kilobase base pairs of genomic DNA (Chang et al.. Genomics 26: 636-7 (1995); 
Foster et al.,.Proc. Natl. Acad. Sci. USA 91: 13023-7 (1994); Gumey et al.. Blood 85: 981-988 
(1995). In order to determine whether there were any genes encoding THPO homologues located 
in close proximity to THPO, genomic DNA fragments from this region were identified and 
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sequenced. Three PI clones and one PAC clones (Genome Systems Inc., St. Louis, MO; cat. 
Nos. Pl-2535 and PAC-6539) encompassing the THPO locus were isolated and a 140 kb region 
was sequenced using the ordered shotgun strategy (Chen et al., Genomics 17: 651-656 (1993)), 
coupled with a PCR-based gap filling approach. Analysis reveals that the region is gene-rich 
with four additional genes located very close to THPO: tumor necrosis factor-receptor type 1 
associated protein 2 (TRAP2) and elongation initiation factor gamma ( e lF 4 () {elF4g}, chloride 
channel 2 (CLCN2) and RNA polymerase JL subunit hRPB17. While no THPO homolog was 
found in the region, four novel genes have been predicted by computer-assisted gene detection 
(GRAIL)(Xu et al.. Gen. Engin. 16: 241-253 (1994), the presence of CpG islands (Cross, S. and 
Bird, A., Curr. Opin. Genet. & Devel. 5: 109-314 (1995), and homology to known genes (as 
detected by WU-BLAST2.0)(Altschul and Gish, Methods Enzymol. 266: 460-480 (1996) 
(http://blaGt.\\aiotl.Qdu.a)laGt/REi\DME.html) .- 

The paragraph beginning at page 312, line 33, has been amended as follows: 
-- ABI DYE-primerTM chemistry (PE Applied Biosystems, Foster City, CA; Cat. 
No.: 4021 12) was used to end-sequence the lambda and plasmid subclones. ABI DYE- 
terminaterTM chemistry (PE Applied Biosystems, Foster City, CA, Cat. No: 403044) was used 
to sequence the PCR products with their respective PCR primers. The sequences were collected 
with an ABI377 instrument. For PCR products larger than Ikb, walking primers were used. 
The sequences of contigs generated by the OSS strategy in AutoAssemblerTM (PE Applied 
Biosystems, Foster City, CA; Cat. No: 903227) and the gap-filling sequencing trace files were 
imported into SequencherTM (Gene Codes Corp., Ann Arbor, MI) for overlapping and editing. 
The sequences generated by the total shotgim strategy were assembled using Phred and Phrap and 
edited using Consed (http://chimera.biot e ch.waGhington.odu/uwgc/proj e ctG.htm) and GFP 
(Genome Reconstruction Manager for Phrap), version 1.2 (http://stork.cellb.bcm.tmc. e du/gfp/) ." 

The paragraph beginning at page 313, line 18, has been amended as follows: 
-The identification and characterization of coding regions was carried out as follows: 
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First, repetitive sequences were masked using RepeatMasker (A.F. A. Smit & P. Green^ 
http://ftp.g e nom e .waGhington. e du/RM/RM_dotailG.htni which screens DNA sequences in FastA 
format against a Ubrary of repetitive elements and returns a masked query sequence. Repeats not 
masked were identified by comparing the sequence to the GenBank database using 
WUBLAST2.0 [Altschul, S & Gish, W., Methods Enzymol. 266: 460-480 (1996)t 
http://blaGt.wuGtl. e du^laGt/RE ADME.html ] and were masked manually.-- 
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